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PREFACE 


The  calculation  of  project  duration  times  and  project 
costa  by  means  of  network  models  has  become  Increasingly 
popular  within  the  last  few  years.  These  models,  which  go 
by  such  names  as  PERT  (^ogram  Evaluation  Review  Technique), 
PEP  (Program  Evaluation  Procedure),  Critical  Path  Scheduling, 
Project  Cost  Curve  Scheduling,  and  others,  have  the  common 
feature  that  uncertainties  In  Job  times  are  either  Ignored 
or  handled  outside  the  network  analysis,  usually  by  replacing 
each  distribution  of  Job  times  by  Its  expected  value. 

This  Project  RAND  Memorandum  describes  a  feasible 
computational  method  that  seems  to  yield  a  fairly  good 
approximation  to  the  expected  duration  time  of  a  project 
whose  Individual  Job  times  are  random  variables. 

Another  possible  area  of  application  of  this  method 
would  be  to  communication  networks  whose  components  are 
subject  to  random  delay  times. 
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SUMMARY 

A  practicable  oiethod  of  computing  an  approximation  to 
the  expected  critical  path  length  In  a  PERT  network  In  which 
arc  lengths  are  random  variables  Is  proposed.  The  usual 
practice  In  this  situation  Is  simply  to  replace  each 
distribution  by  Its  expected  value,  thereby  obtaining  a 
deterministic  problem  whose  solution  provides  an  estimate  g 
of  the  ejqpected  critical  path  length  e.  This  estimate  Is 
always  optimistic:  g  ^  e.  The  proposed  method  yields  a 
usually  better,  and  never  worse,  approximation  f  satisfying 
g  <  f  ^  e. 
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EXPECrrO  CRITICAL  PATH  LENGTHS  IN  PERT  fJETWORKS 


1 ■  INTRODUCTION 

In  this  note  we  propose  a  method  for  calculating  an 
appiToxlmatlon  to  the  expected  length  of  a  critical  path  In  a 
PERT  network  In  which  arc  lengths  are  random  variables.  The 
usual  practice  In  this  situation  (as  in  other  linear  prograim- 
mlng  problems  with  random  objective  functions)  is  simply  to 
replace  each  distribution  by  Its  expected  value,  thereby 
obtaining  a  deterministic  problem  whose  solution  provides  an 
estimate  g  of  the  expected  critical  path  length  e.  This 
estimate  Is  always  optimistic:  g  <  e.  We  shall  describe  a 
way  of  computing  a  usually  better,  and  never  worse,  approxi¬ 
mation  f;  this  approximation  satisfies  g  <  f  <  e.  The 
proof  of  this  Inequality  Is  easy,  once  the  procedure  for 
obtaining  f  has  been  described,  but  the  basic  Idea  seems 
not  to  have  been  noted  before. 

The  PERT  model  [4]  is  reviewed  In  Sec.  2.  Section  3 
Introduces  the  notation  used  throughout  the  remainder  of  the 
paper  and  outlines  sufficient  conditions  on  the  underlying 
distributions  In  order  that  the  inequality  g  <  f  <  e  hold. 
The  procedure  for  calculating  f  Is  presented  in  Sec.  4, 
and  a  proof  of  the  Inequality  given.  Section  5  contains  some 
nvunerlcal  examples  for  which  the  numbers  g,  f,  e  are 
compared.  The  concluding  Sec.  6  sketches  an  application  to 
the  random  shortest— path  problem  In  more  general  networks. 
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2.  PERT  NETWORKS 

A  PERT  network  Is  a  directed,  acyclic  network.  In  the 
PERT  model,  such  a  network  Is  viewed  as  representing  a  partial 
ordering  of  the  many  Individual  "Jobs"  (arcs  of  the  network) 
that  together  comprise  some  "project"  (the  complete  network), 
the  partial  ordering  coming  from  the  requirement  that  all 
Inward— pointing  Jobs  at  a  node  must  be  finished  before  any 
outward-^polntlng  Job  at  the  node  can  be  started.  Any 
partially  ordered  set  can  be  so  represented  by  Introducing 
dummy  arcs  appropriately.  We  may  also  add  to  the  network. 

If  necessary,  two  distinguished  nodes,  called  origin  and 
terminal,  respectively,  together  with  appropriate  outward¬ 
pointing  arcs  at  the  origin  and  inward— pointing  arcs  at  the 
termlnalj  in  such  a  way  that  every  node  Is  contained  In  some 
path  (directed  chain)  from  origin  to  terminal.  The  nodes  of 
the  network  are  thought  of  as  "events"  in  time;  if  Jobs  are 
assigned  duration  times,  i.e.,  arcs  are  assigned  lengths,  the 
length  of  a  longest  path  from  the  origin  to  any  other  node  of 
the  network  represents  the  (earliest)  time  of  occurrence  of 
that  node.  The  duration  time  of  the  entire  project  is  then 
the  length  of  a  longest  path  from  origin  to  terminal.  Such  a 
path  is  called  critical.  We  also  use  this  terminology  in 
speaking  of  a  longest  path  from  the  origin  to  some  other  node. 

Since  the  network  is  acyclic,  it  is  a  trivial  matter  to 
compute  critical  path  lengths  from  the  origin  to  all  other 
nodes,  given  a  fixed  assignment  of  arc  lengths.  But  if  arc 
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lcngth0  are  random  varlablee.  It  Is  usually  not  feasible  to 
compute,  precisely,  the  expected  critical  path  lengths  from 
the  origin.  For  example.  If  job  times  are  Independently 
distributed  and  If  each  can  assume  one  of  two  values,  an 
optimistic  and  a  pessimistic  estimate,  say,  and  If  there  are 
m  Jobs,  then  2®  deterministic  problems  need  to  be  solved. 
Since  a  typical  PERT  network  may  Involve  hundreds  or  thousands 
of  arcs,  the  precise  'ialculatlon  of  expected  critical  path 
lengths  would  of  course  be  out  of  the  question. 

3«  ASSUMPTIONS  AND  NOTATTQM 

Suppose  given  a  PERT  network  as  described  In  Sec.  2, 
with  an  origin  and  terminal,  index  the  nodes  l,2,...,n  In 
such  a  way  that  1  la  the  origin,  n  the  terminal,  and  If 
there  Is  an  arc  from  node  1  to  node  J,  then  1  <  J. 

(Such  a  numbering  can  be  secured  as  follows.  The  origin, 
with  only  outwardr-polntlng  arcs.  Is  numbered  1.  Delete  all 
arcs  from  the  origin  and  search  for  nodes  In  the  new  network 
that  have  only  outward-pointing  arcs;  number  these  2,5,...,k 
In  any  order.  In  general,  delete  all  arcs  pointing  out  from 
newly  numbered  nodes,  look  for  nodes  with  only  outward¬ 
pointing  arcs,  and  number  these  consecutively  In  any  order. 
Eventually  the  terminal  Is  numbered  n.  See  Pig.  5.1  for  an 
example . ) 

We  also  Index  the  arcs  l,2,...,m  in  any  order  and  let 
a  range  over  these  Indices. 


Pig.  5.1 


For  each  node  1  we  let  denote  the  bundle  of 

inward— pointing  arcs  at  node  1.  (Thus  Is  always  the 

empty  set.  In  the  example  of  Pig.  5.I,  ^  «  (l),  «  (2,5), 

and  so  on.)  We  allow  Joint  probability  distributions  on  arc 
lengths  within  a  bundle,  the  distributions  on  distinct  bundles 
being  assumed  Independent.  Thus,  corresponding  to  each 
bundle  B  of  arcs,  say 

(5.1)  B  *  (o^^, . . . ,otj^) , 

there  Is  a  nonnegative*  random  vector  of  aro-length  variables 

(5.2)  tg  = 


¥ 


The  assumption  of  nonnegativity  is  not  essential  In 
this  or  the  following  section. 


assumed  to  have  a  finite  probability  distribution.  Taking 
the  usual  liberty  with  functional  notation,  we  let 

(3-3)  pCtg) 

represent  this  distribution,  U*respectlve  of  the  bundle. 

An  Important  special  case  of  these  assumptions  Is  that 
In  which  Individual  arc  lengths  are  distributed  Independently. 
Since  we  do  not  need  this  more  stringent  requirement,  and 
since  one  can  conceive  of  practical  problems  in  which  arc 
lengths  within  a  bundle  are  correlated,  with  Independence 
between  bundles  (for  example,  commur loatlon  networks  In  which 
nodes  are  subject  to  random  delay  times),  we  have  stated  our 
assumptions  as  generally  as  possible. 

For  a  given  assignment 

(3.4)  t  =  (ti,t2,...,tjjj) 

of  lengths  to  all  arcs  of  the  network, 

(5-5)  p(t)  «  p(t  )p(tp  )...p(t„  ) 

Is  the  probability  of  this  assignment,  and 
(3.6)  i^(t) 

denotes  the  length  of  a  longest  or  critical  path  from  node  1 
to  node  1,  given  the  assignment  t.  Then 


Is  the  expected  length  of  a  critical  path  to  node  1. 

Observe  that  for  a  given  assignment  t,  the  length 
i^(t)  of  a  critical  path  to  node  i  does  not  depend  on  the 
values  assigned  to  arcs  of  the  bundles  since 

no  path  from  1  to  i  uses  any  of  these  arcs.  Accordingly, 
if  we  extend  the  notation  (5*2)  to  arbitrary  sets  S  of  arcs, 

(3.8)  tg  =  (t^  » •  •  •  ^  ^ 

12  s 

then  (3.7)  may  be  written  as 

(3.9)  ®1  *  5  -^i^^A.^’ 

^i 

where 

1 

(3-10) 

lC“  1 

(3.11)  p(t^^)  = 

Let  t  be  a  component  of  the  bundle  vector  tg.  Then 
oc 

the  expected  length  of  arc  a  is 
(J.12)  I  P(‘b>*c.- 


For  the  arc  lengths  (3*12),  we  let  g^  denote  the  length  of 


a  critical  path  to  node  1.  The  may  be  computed  by  the 

recursion 

«!  -  0, 

(5.13)  I  •  *  * 

-  max  ^1-1)'  ^  ■  2, .  ..,n. 

Here  we  are  assuming,  and  without  loss  of  generality,  that 
consists  of  arcs  from  nodes  1,2,...,!  —  1  to  1,  and 
that,  at  the  1— th  stage  In  applying  (5.15),  these  arcs  ai^e 
numbered  1,2,...,!  —  1,  respectively.  Of  course.  If  the 
arc  k  (from  k  to  l)  does  not  exist  In  the  network,  then 
the  corresponding  term  gj^  ^k  (5*13)  1b  Ignored.  This 
Is  equivalent  to  setting  such  equal  to  —  00  . 

4.  THE  NUMBERS  f^ 

The  computation  (3*13)  1b  an  eaBy  taBk,  even  for  large 
PERT  networks  having  thousands  of  arcs.  Moreover,  It  Is  not 
hard  to  see  (and  will  be  a  by-product  of  the  proof  at  the  end 
of  this  section)  that  the  Inequalities 

(4.1)  g^  ^  ^  "  l#2,...,n, 

hold.  We  now  describe  an  analogous  but  somewhat  more  complex 
computation,  which  appears  to  yield  much  closer  approximations 
to  the  expected  critical  path  lengths. 

Define  the  numbers  f^  recursively  by 


(4.2) 


[  -  2  p(tg  )  max(f^  +  1  -  2,...,n. 

As  in  (5*13) i  it  is  assvuned  In  (4.2)  that  in  computing 
the  arc  n\imbering  has  been  selected  so  that 

(4.3)  tg^  ■  ( ^2*  *  *  * * 

with  appropriate  components  set  equal  to  —  oo  . 

Before  proceeding  further,  we  Illustrate  (3. 13)  and  (4.2) 
by  means  of  a  small  numerical  example.  Let  the  network  be 
that  of  Pig.  4.1  below  and  suppose  that  each  arc  Independently 
assumes  the  lengths  0  or  1,  each  with  probability  1/2. 


Applying  (3.13)  first,  we  have 


«1 


-  0, 


-  Si  + 


1 


Si,  “ 


84  - 


(Si  + 

(gg  +  +  |) 


-  1, 


We  next  use  (4.2)  to  obtain 


-  0, 

^2  ■  ■  ?> 

^  max  (f^  +  o,r^  +  0)  +  J  max 

+  ^  max  (f^  +  l,r^  +  0)  +  J  max 

-  J(|-  +  f  +  1  +  J)  - 

f 4  ■  max  (fg  +  0»fj  +  0)  +  ^  max 

+  ^  max  (fg  +  +  0)  +  ^  max 

•  ?(§  +  ^  +  I  +  ^)  - 


(fl  + 
(fl  + 

(fg  + 
(^2  ■*■ 


O^fg  +  1) 

l>f2  +  1) 

+  1) 
+  1) 


An  examination  of  the  2^  ■  52  cases  Involved  In  calculating 
the  expected  critical  path  lengths  e^^  shows  that  -  e 
In  this  example. 

Note  that  the  real  difference  between  the  f— calculation 
and  the  c- calculation  In  the  arc— Independent  case  Is  that> 
roughly  speaking j  the  former  Increases  exponentially  with 
bund  '  slze^  while  the  latter  Increases  exponentially  with 
network  size.  Thus  the  f— calculation  could  well  be  feasible 
In  large  networks . 
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V*  now  show  that  for  any  PKRT  natwork,  the  Inequalities 

(^•^)  «i  1  1  l-l,2,...,n,  . 

are  satisfied.  Clearly  (4.4)  holds  for  1-1,  and  we 
proceed  by  Induction.  Thus  we  assume  (4.4)  for  1,2,... ,l 
and  consider  the  numbers  We  show  first  that 

‘"•s)  8i+i  i  ^t+r 

By  (4.2).  i.ttlng  B  .  .  (1,2,. ...1),  w,  have 

o 

^1+1  "  J  P(V  ““  <^1  +  h>***Bf4  +  t.), 
or 

(4.6)  .  I  max  (p(t3)(fj  +  tj), .  ..,p(t3)(fj  +  t^)). 

B 

Interchanging  the  operations  of  summing  and  taking  the  maximum 
In  (4.6)  yields  the  inequality 

('*•7)  fi+;^2inax[l  P(t3)(fj  +  t^),...,  S  p(t3)(f^  +  t,)] 

Hence  from  (5.12),  noting  that  f^,...,,f^  do  not  depend 
on  tg,  we  have 

(^•8)  ^  max  [f^+  'F^,...,f^  +  t^]. 

Consequently  the  Induction  assumption  and  (3.I3)  imply 
(^•9)  ^1+1  ^ 


establishing  (4.5). 
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The  proof  will  be  eoiQ)lete<l  by  showing  that 
(4.10)  i 

To  simplify  the  notation,  lot  A  -  U  Bl,  c  -  R, 

k»l  ^  Ic*i  ^ 

®  "  ®1+1  "  (1*2* -..aI).  Then  from  (5.9)*  (5.11),  and  the 
equation 

^l+l(V  -  ““  +  ‘l-’-'-Vt*)  +  ‘iJ. 


we  have 


’l+l  ■  J  f  P(‘a)p(V  ““  'V*a)  +  h'-"'V‘A>  +  *l)< 

'A  ‘'B 


*1+1  -f  P(‘b)  f  “«  fP(pA)('l(‘A)  +  Pl)'---< 


*B  “  ‘a 


Pt»A)('*l(*A)  +  ‘l)l 


It  follows  that 


(4.12)  ^  2  p(tg)  max  [  2  +  t, . 

B  *A 

I  p<*a)^i(»a)  +  *il 

A 

Consequently,  using  (5.9)  again,  we  have 

(^•15)  P(tg)  max  [e^  +  t^,  +  t^]. 


The  Induction  assumption  and  (4.2)  now  Imply 
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(4.14)  « 1+1  2  2:  p(tg)  max  [f^  +  t^, 

This  verifies  (4.10)  and  thus  proves  (4.4). 

It  should  be  pointed  out  that  the  inequalities  Si  <  ®i 
hold  under  the  weaker  assumption  that  all  arc  lengths  are 
distributed  Jointly,  as  is  readily  verified.  But  the 
assumption  of  bundle  Independence  is  essential  for  (4.4),  as 
the  following  example  shows.  Let  the  network  be  that  of 
Pig.  4.2  below,  with  the  distribution  of  arc  lengths  given  in 
the  accompanying  table. 


Pig.  4.2 

Then  e^  -  g^  =  1,  but  f^  »  5/4. 

We  also  remark  that  all  of  the  various  logical  possi¬ 
bilities  Implicit  in  (4.4)  can  occur.  It  is  not  difficult 
to  construct  examples  in  which 

(4.15)  8n  <  'n  <  «n' 

<  'n  -  'n’ 


(4.16) 


-1>- 


(^.17) 

(^.18) 


«n  -  'n  < 

•  ®n* 


For  Instance,  the  netfcjrlc  of  Fig.  t.s 
independent  distribution 


satisfies  (4.17)..  g3  -  f^  -  1  <  -  5/4. 


li _ SOME  NUMERICAL  EXAMPT.Rg 

In  this  section  we  present  sosie  numerical  examples  for 

purposes  of  comparing  the  numbers  g„,f„,e„.  In  each  of  the 

examples,  we  have  assumed  the  same  unlfomn  distribution  on 
each  arc,  distributions  on  distinct  arcs  being  Independent. 
The  numbers  recorded  beside  a  typical  arc  are  the  lengths 
that  can  be  taken  on.  In  Examples  2  and  3,  the  computation 

of  e„  was  carried  out  on  a  computer,  using  a  program  written 
by  R.  Clasen. 
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Example  3. 


gg  -  2.50,  fg  -  2.92,  eg  -  2.95* 

6.  MINIMAL  PATHS  IN  GENERAL  NEWORKS 

If  one  1b  Inteirested  in  expected  minimal  path  lengths  In 
acyclic  directed  networks,  the  theory  of  Sec.  k  remains  valid 
by  replacing  "max"  by  "min"  and  reversing  all  Inequalities. 
Suppose  we  depart  from  the  PERT  context  completely  and  consider 
general  directed  networks  whose  arc  lengths  are  nonnegative 
random  variables.  What  can  be  said  about  expected  maximal  or 
minimal  path  lengths  from  some  node  to  all  others  in  this  case? 

Since  the  problem  of  determining  a  max jjnal— length  simple 
path  between  two  given  nodes  In  the  deterministic  case  Is  no 


The  warehousing  problem  with  random  costs  can  be  formu¬ 
lated  In  this  way  [3J*  Also,  a  number  of  stochastic  dynamic 
programming  problems  have  such  formulatlona. 


-16- 


longer  easy,  being  equivalent  to  a  travel In^alesman  problem, 
we  restrict  attention  to  the  mlnlmal-^ath  case,  for  which  the 
character  of  the  deterministic  problem  Is  still  much  the 
same  [1,2]. 

Although  It  can  be  shown  that  the  analogues  of  equations 
(4.2)  have  a  solution,  and  In  fact,  under  fairly  mild 
conditions  on  the  network  and  the  distributions  Involved,  a 
unique  solution.  It  Is  no  longer  true  that  the  analogous 
Inequalities  e^^  ^  f^^  hold  for  this  solution.  The  following 
example  demonstrates  this: 


In  Pig.  6.1  we  are  assuming  that  the  arc  lengths  written 
adjacent  to  an  arc  are  equlprobable,  with  distributions  on 
distinct  arcs  Independent.  The  unique  solution  of  the 
equations 
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fg  -  ^  min  (f^  +  4,f^  +  0)  +  y  min  +  4,f^  +  1) 

+  J  min  (f^  +  6,fj  +  0)  +  J  min  (f^  +  6,f^  +  1), 

^  min  (f^  +  l,t^  +  1)  +  J  min  (f^  +  l,f2  +  5) 

^  min  (f^  +  10,f2  +  1)  +  J  min  +  10,f2  +  5), 


fl  -  0,  fg  -  -  4^/^. 

the  expected  minimal  path  lengths  from  node  1  are 


for  this  exaaqple. 

The  Inequality  falls  to  hold  In  networks  with  cycles 
because  the  distribution  of  minimal  path  lengths  to  beginning 
nodes  of  the  bundle  of  arcs  pointing  Into  a  given  node  can 
depend  on  the  distribution  of  arc  lengths  for  the  bundle. 
However,  ai^roxlmatlons  f£  that  still  satisfy 

(6.1)  ®1  1  1 


can  be  calculated  by  the  following  device.  First  assign  each 
arc  Its  expected  length  and  compute  the  numbers  g^  repre¬ 
senting  the  minimal  path  length  frcmt  1  to  1  under  this 
assignment.  Next  renxjmber  the  nodes  according  to  Increasing 


k*«plng  the  origin  numbered  1»  delete  all  arcs  that 

lead  from  hlghei>-ouabered  nodes  to  l»lo«er"-nunbered  nodes, 
restricting  the  probability  dlstrlbvi.'f**^®**®  renalnlng 

arcs.  The  suh-network  left  Is  aisycL---!®  *****  numbers  e|, 
^1*  satisfy 

(6.2)  ®i  <  <  8i* 

Now  e^  <  e|,  since  the  e|  cone  rit‘^  a  sub-network  of  the 

original  network.  But  note  that  holds  In  the 

corresponding  inequality  g^  ^  g[.  a?®"**  **®  have 

(6.3)  ®1  <  «!  <  **1  <  Sj- 


which  yields  (6.1). 
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